A Regression Model for Count Data with Observation-Level Dispersion
نویسندگان
چکیده
While Poisson regression is a popular tool for modeling count data, it is limited by its associated model assumptions. One assumption is that the response variable follows a Poisson distribution. However, overor under-dispersion are common in practice and are not accommodated by Poisson regression. In addition, the dispersion is assumed fixed across observations, whereas in practice dispersion may vary across groups or according to some other factor. Recently, Sellers and Shmueli (2008) introduced the Conway-Maxwell-Poisson (CMP) regression, based on the CMP distribution. CMP regression generalizes both Poisson and logistic regression models and allows for overor under-dispersed count data. The model structure introduced, however, assumes a fixed dispersion level across all observations. In this paper, we extend the CMP regression model to account for observation-level dispersion. We discuss model estimation, inference, diagnostics, and interpretation, and present a variable selection technique. We then compare our model to several alternatives and illustrate its advantages and usefulness using datasets with varying types and levels of dispersion.
منابع مشابه
کاربرد مدل رگرسیون پواسنی تعمیم یافته در تحلیل دادههای باروری زنان روستایی استان فارس
Background & objectives: statistical modeling explicates the observed changes in data by means of mathematics equations. In cases that dependent variable is count, Poisson model is applied. If Poisson model is not applicable in a specific situation, it is better to apply the generalized Poisson model. So, our emphasis in this study is to notice the data structure, introducing the generalized Po...
متن کاملHurdle, Inflated Poisson and Inflated Negative Binomial Regression Models for Analysis of Count Data with Extra Zeros
In this paper, we propose Hurdle regression models for analysing count responses with extra zeros. A method of estimating maximum likelihood is used to estimate model parameters. The application of the proposed model is presented in insurance dataset. In this example, there are many numbers of claims equal to zero is considered that clarify the application of the model with a zero-inflat...
متن کاملEstimation of Count Data using Bivariate Negative Binomial Regression Models
Abstract Negative binomial regression model (NBR) is a popular approach for modeling overdispersed count data with covariates. Several parameterizations have been performed for NBR, and the two well-known models, negative binomial-1 regression model (NBR-1) and negative binomial-2 regression model (NBR-2), have been applied. Another parameterization of NBR is negative binomial-P regression mode...
متن کاملPerformance of Generalized Poisson Regression Model and Negative Binomial Regression Model in case of Over-dispersion Count Data
This paper represents the comparison between Negative Binomial Regression model and Generalized Poisson Regression model for over-dispersion count data. For this comparison, we used BDHS 2007 data in where the response variable is the total children ever born which is a count data. When the response variable is count, then Poisson Regression Model as a Generalized Linear Model is widely and pop...
متن کاملZero-inflated generalized Poisson models with regression effects on the mean, dispersion and zero-inflation level applied to patent outsourcing rates
This paper focuses on an extension of zero-inflated generalized Poisson (ZIGP) regression models for count data. We discuss generalized Poisson (GP) models where dispersion is modelled by an additional model parameter. Moreover, zero-inflated models in which overdispersion is assumed to be caused by an excessive number of zeros are discussed. In addition to ZIGP regression introduced by Famoye ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2009